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Molecular mimicry is an attractive mechanism for triggering 
autoimmunity. In this review, we explore the potential role of 
evolutionary conserved bacterial proteins in the production 
of autoantibodies with focus on granulomatosis with poly- 
angiitis (GPA) and rheumatoid arthritis (RA). Seven auto- 
antigens characterized in GPA and RA were BLASTed 
against a bacterial protein database. Of the seven auto- 
antigens, proteinase 3, type II collagen, binding immunoglo- 
bulin protein, glucose-6-phosphate isomerase, a-enolase, 
and heterogeneous nuclear ribonuclear protein have well- 
conserved bacterial orthologs. Importantly, those bacterial 
orthologs are also found in human-associated bacteria. The 
wide distribution of the highly conserved stress proteins or 
enzymes among the members of the normal flora and com- 
mon infectious microorganisms raises a new question on how 
cross-reactive autoantibodies are not produced during the 
immune response to these bacteria in most healthy people. 
Understanding the mechanisms that deselect auto-reactive 
B cell clones during the germinal center reaction to homolo- 
gous foreign antigens may provide a novel strategy to treat 
autoimmune diseases. 
[Immune Network 2014;14(1):7-13] 



INTRODUCTION 

Autoimmune diseases are complex immune-mediated dis- 
eases that involve both genetic and environmental factors in 
their pathogenesis. Infectious microorganisms have long been 
suggested to trigger an immune response to autoantigens by 
providing stimuli for the breakdown of self-tolerance and also 
by generating cross-reactive T cells and antibodies via molec- 
ular mimicry. Molecular mimicry is a mechanism that has a 
proposed role in many autoimmune diseases such as acute 
rheumatic fever, rheumatoid arthritis, Guillain-Barre syn- 
drome, multiple sclerosis, type 1 diabetes mellitus, and Lyme 
arthritis. In autoimmune diseases, the concept of molecular 
mimicry has often been used to describe similar structures 
shared by molecules from dissimilar proteins, as illustrated by 
the a -helical coiled-coil streptococcal M protein and cardiac 
myosin in rheumatic fever. However, some proteins such as 
heat shock proteins are evolutionally highly conserved from 
prokaiyotes to eukaryotes. In this review, we explore the po- 
tential role of the evolutionary conserved bacterial proteins 
in the production of autoantibodies with focus on gran- 
ulomatosis with polyangiitis (GPA) and rheumatoid arthritis 
(RA). 
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Table 1. Selected bacteria that contain 


human PR3-homologous proteins 








Bacteria 


Protein 


Query cover (%) 


Identities (%) 


Similarities (%) 


Myxococcus xanthus 


S1 A family peptidase 


88 


35 


45 


Actinosynnema mirum 


S1 and S6 peptidase 


92 


34 


47 


Streptomyces clavuligerus 


Trypsin 


83 


35 


48 


Pseudoalteromonas tunicata 


Secreted trypsin-like serine protease 


91 


33 


48 


Saccharomonospora viridis 


Secreted trypsin-like serine protease 


93 


31 


44 


Saccharopolyspora erythraea 


Secreted trypsin-like serine protease 


95 


30 


43 


Vibrio cholerae 


Serine protease 


87 


27 


46 


Vibrio parahaemolyticus 


Trypsin family protein 


78 


31 


46 



GPA AND ANTI-NEUTROPHIL CYTOPLASMIC 
AUTOANTIBODIES (ANCA) 

GPA (Wegener's) is a type of ANCA-associated vasculitis that 
affects small- and medium-sized vessels in many organs. Its 
clinical symptoms include fever, fatigue, weight loss, nasal 
discharge, sinusitis, cough, dyspnea, hematuria, and protein- 
uria. Pathologically, GPA is characterized by multi-focal gran- 
ulomatous inflammation with central necrosis and necrotizing 
vasculitis. In addition, the presence of an ANCA in serum is 
used as a diagnostic marker. There are two types of ANCAs, 
i e. , directed against either proteinase 3 (PR3-ANCA) or mye- 
loperoxidase (MPO-ANCA). The ANCA antigen specificity of 
GPA in European patients is predominantly PR3, whereas that 
in Japanese patients is predominantly MPO. The pathogenesis 
of necrotizing vasculitis is understood to involve infiltration 
of neutrophils into vessel walls and their subsequent activ- 
ation. ANCA is known to mediate the activation of neu- 
trophils and degranulation via crosslinking of the Fc receptor 
and antigens expressed on the membrane. However, what 
causes the production of ANCA and granulomatous inflam- 
mation, is not known. 

Interestingly, bacterial infection has been implicated in the 
initiation and relapse of GPA. Early observations suggested 
a potential role of infection in the relapse of disease. Chronic 
nasal carriage of Staphylococcus aureus is increased in GPA 
patients compared to healthy subjects (63% vs. 25%), and the 
carriage of S. aureus increases the risk of relapse by 7. 16 fold 

(9) . Furthermore, the addition of trimethoprim/sulfamethox- 
azole to maintenance treatment reduced the relapses by 60% 

(10) . Although the underlying mechanisms for the increased 
relapse by S. aureus is not clear, potential roles in polyclonal 
activation of B cells, priming of neutrophils, and induction 
of anti-idiotypic antibodies to PR3-ANCA have been suggested 



(6). Kain et al. reported a new ANCA directed against lysoso- 
mal membrane glycoprotein 2 (LAMP-2) as a specific marker 
for focal necrotizing glomerulonephritis and showed that the 
autoantibodies to LAMP-2 can be induced by immunization 
with FimH, a bacterial fimbrial adhesin of Gram-negative bac- 
teria (11). The eight amino acids of one LAMP-2 epitope 
(P41-49) recognized by autoantibodies have a strong homology 
with the FimH of several common Gram-negative species, 
suggesting molecular mimicry between the two proteins. 

We previously demonstrated that the membrane bound 
PR3 on neutrophils acts as a receptor for non-opsonic phag- 
ocytosis of bacteria and that the neutralization of PR3 with 
ANCA reduces both binding and phagocytosis of bacteria 
(12). It raised the possibility that ANCA may be induced by 
some pathogens possessing a PR3-homologous protein to 
avoid host immunity. When the bacterial protein database 
was searched using the PR3 protein sequence as a query, 
hundreds of bacterial proteases with 28% to 36% identity 
were indeed found. Among the bacteria containing PR3-ho- 
mologous proteases, only Vibrio cholerae, V. vulnificus, V. 
parahaemolyticus, and Saccharomonospora viridis are known 
to infect humans (Table I). The spores of S. viridis, a 
gram-negative bacterium frequently found in hot compost and 
hay, can be readily dispersed in air. Prolonged exposure to 
those spores can cause farmer's lung, bagassosis, and humidi- 
fier fever that manifest symptoms such as fever, malaise, 
cough, and dyspnea similar to GPA (13). Granulomas are 
usually formed in an attempt to segregate foreign substances 
that are resistant to phagocytic clearance. If S. viridis induces 
ANCA production via molecular mimicry, the ANCA would 
inhibit phagocytosis of S. viridis. Furthermore, if S. viridis sur- 
vives after phagocytosis, infection with S. viridis may also 
contribute to the formation of granulomatous inflammation. 
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AUTOANTIGENS CHARACTERIZED IN 
RHEUMATOID ARTHRITIS (RA) 

A literature search revealed that Wegner et al. already pro- 
posed evolutionarily conserved antigens as stimuli to cause 
breakdown of tolerance and reported well-conserved bacte- 
rial orthologs for pyruvate dehydrogenase complex E2, gluta- 
mic acid decarboxylase, histidyl-tRNA synthetase, and enolase 
among seven major autoantigens examined (14). Therefore, 
we further explored the possibility that evolutionarily con- 
served bacterial proteins are involved in autoantibody pro- 
duction in RA, the most common systemic autoimmune 
disease. 

RA is characterized by the presence of diverse autoanti- 
bodies in serum and synovial fluid. Rheumatoid factor (RF) 
is the first autoantibody described in RA, which is directed 
against the Fc portion of IgG, a major serum component (15). 
Together with the RF, anti-citrullinated protein antibodies are 
clinically important. The anti-citrullinated protein antibodies 
are a group of autoantibodies that recognize citrulline-con- 
taining peptides/proteins as common antigenic epitopes (16). 
Other antigens characterized as targets of autoantibodies in 



RA include type II collagen (CII), binding immunoglobulin 
protein (BiP), glucose-6-phosphate isomerase (G6PI), ff-eno- 
lase, and heterogeneous nuclear ribonuclear protein (hnRNP) 
A2 (17). We searched the bacterial protein database using 
these human proteins as a query. 

Members of the immunoglobulin superfamily have also 
been found in bacteria (18). Homology search with the se- 
quence of human immunoglobulin gamma- 1 heavy chain 
constant region did not retrieve any bacterial proteins. When 
the CH2 and CH3 domains, the antigenic epitopes of RF, 
were searched separately, however, several bacterial proteins 
such as the cell wall binding repeat 2 family protein and cell 
surface protein of Clostridium difficile were found to share 
46% similarities with the CH2 over 76% of the domain. 

CII is a component of both articular and hyaline cartilage. 
RA patients often contain high titers of anti-CII antibodies in 
sera and synovial fluids (19). Because CII is exclusively ex- 
pressed in the cartilage, autoantibodies against CII induce 
joint destruction. Surprisingly, a large number of bacterial 
species contain collagen triple helix repeat family proteins 
that share a high degree of identity (32~47%) and homology 
(36 ~ 52%) with human CII. Among those, important human 



Table II. Selected human-associated bacteria that contain human Cll-homologous proteins 



Bacteria (Human association) 



Protein 



Query Identities Similarities 
cover (%) (%) (%) 



Clostridium beijerinckii (gut) 


Triple helix repeat-containing collagen 


75 


34 


39 


Bacillus cereus* (infection in gut, eye, lung) 


Collagen triple helix 


repeat domain protein 


76 


40 


45 


Bacillus thuringiensis (infection in gut, lung) 


Triple helix repeat-containing collagen 


92 


40 


46 


Brevibacillus brevis (peritonitis) 


Collagen like protein 


65 


42 


49 


Clostridium difficile* (infection in gut) 


Collagen triple helix 


repeat family protein 


78 


42 


46 


*Bacteria that have been reported to develop reactive arthritis, septic arthritis, or rheumatic symptoms. 






Table III. Selected human-associated bacteria that contain human BiP-homologous proteins 








Bacteria (Human association) 


Protein 


Query cover (%) 


Identities (%) 




Similarities (%) 


Bartonella rochalimae* (bacteremia) 


HSP 70 DnaK 


95 


53 




69 


Brucella abortus* (brucellosis) 


DnaK 


92 


54 




70 


Borrelia burgdorferi* (lyme disease) 


DnaK 


93 


53 




70 


Rickettsia felis* (spotted fever) 


DnaK 


95 


52 




68 


Prevotella oralis (oral cavity) 


DnaK 


94 


50 




68 


Atopobium parvulum (oral cavity) 


DnaK 


95 


51 




68 


Prevotella veroralis (oral cavity) 


DnaK 


95 


50 




67 


Prevotella nigrescens (oral cavity) 


DnaK 


94 


50 




67 


Prevotella denticola (oral cavity) 


DnaK 


95 


49 




67 



*Bacteria that have been reported to develop reactive arthritis, septic arthritis, or rheumatic symptoms. 
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pathogens such as Clostridium difficile and Bacillus cereus 
(20,21) and a member of normal gut flora Q beijerinckii (22) 
are included (Table II). 

BiP (also known as 78 kDa glucose-regulated protein or 
heat shock 70 kDa protein 5), is a stress protein located in 
the endoplasmic reticulum. Stress proteins are evolutionally 
highly conserved from bacteria to human. Furthermore, mi- 
crobial stress proteins are highly immunogenic (23). Therefore, 
cellular and humoral immune responses initiated by microbial 
stress proteins may target cross-reactive self proteins, resulting 
in autoimmunity (24). Among the hundreds of bacterial spe- 
cies that contain a BiP-homologous stress protein DnaK, hu- 
man pathogens such as Bartonella spp., Brucella spp., 
Borrelia spp. , and Rickettsia spp, were found. In addition to 



the pathogenic bacteria, a number of bacteria in the human 
oral flora such as Prevotella spp. and Atopobium parvulum 
contained the BiP-homologous DnaK. Selective examples of 
bacterial DnaK listed in Table III have 49 ~ 54% identities and 
67 ~ 70% similarities with the human BiP over 92% of the en- 
tire protein. 

Glucose-6-phosphate isomerase (G6PI) is a ubiquitously 
expressed glycolytic enzyme that is also highly conserved 
through evolution. Selected bacterial G6PIs for several hu- 
man-associated species present even higher identities (65 ~ 
68%) and similarities (78~80%) than those observed in the 
stress protein BiP (Table IV). The human-associated bacteria 
include both commensals and pathogens that colonize the 
respiratory tract, urinary tract, or gastrointestinal tract. Given 



Table IV. Selected human-associated bacteria that contain human G6PI-homologous proteins 


Bacteria (Human association) 


Protein 


Query cover (%) 


Identities (%) 


Similarities (%) 


Haemophilus sputorum (sputum, throat swab, blood) 


G6PI 


95 


68 


80 


Providencia rettgeri (gut, opportunistic infection) 


G6PI 


97 


67 


79 


Yersinia pseudotuberculosis* (enteritis) 


G6PI 


97 


66 


78 


Haemophilus parahaemolyticus (pharyngitis) 


G6PI 


98 


66 


78 


Escherichia fergusonii (urinary tract infection) 


G6PI 


98 


65 


79 


Escherichia coli* (gut, urinary tract) 


G6PI 


98 


65 


79 


Haemophilus haemolyticus (nasophaynx) 


G6PI 


95 


65 


80 


Haemophilus influenza* (pneumonia) 


G6PI 


96 


65 


79 


Serratia marcescens* (urinary tract infection) 


G6PI 


97 


66 


78 


Salmonella enteric* (enteritis) 


G6PI 


98 


65 


78 


Shigella flexneri* (enteritis) 


G6PI 


98 


65 


78 


*Bacteria that have been reported to develop reactive arthritis, septic arthritis, or rheumatic symptoms. 




Table V. Selected human-associated bacteria that contain humar 


i ff-enolase-homologous proteins 






Bacteria (Human association) 


Protein 


Query cover (%) 


Identities (%) 


Similarities (%) 


Fusobacterium sp. CAC:439 (gut) 


Enolase 


97 


55 


72 


Clostridium sp. CAG-.306 (gut) 


Enolase 


97 


55 


71 


Treponema denticola (oral cavity, periodontitis) 


Enolase 


99 


54 


70 


Turicella otitidis (otitis) 


Enolase 


97 


53 


71 


Bacillus smithii (gut) 


Enolase 


98 


53 


72 


Propionibacterium sp. KPL1844 (Human microbiome project) 


Enolase 


97 


55 


70 


Propionibacterium acnes (skin, gut) 


Enolase 


96 


55 


70 


Actinomyces sp. Oral taxon 1 78 (oral cavity) 


Enolase 


97 


55 


69 


Neisseria sicca (oral cavity) 


Enolase 


97 


53 


69 


Clostridium botulinum (food-born infection) 


Enolase 


97 


52 


71 


Acinetobacter sp. NIPH 542 (gut) 


Enolase 


97 


52 


69 


Bacillus cereus* (infection in gut, eye, lung) 


Enolase 


98 


52 


71 


Brevibacillus brevis (peritonitis) 


Enolase 


97 


51 


70 


Neisseria meningitidis* (nasopharynx, meningitis) 


Enolase 


97 


53 


69 



*Bacteria that have been reported to develop reactive arthritis, septic arthritis, or rheumatic symptoms. 
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Table VI. Selected human-associated bacteria that contain human hnRNP A2-homologus proteins 



Bacteria (Human association) 


Protein 


Query 
cover (%) 


Identities 

(%) 


Similarities 

(%) 


Variovorax paradoxus (oral cavity) 


Rnp-1 like RNA-binding protein 


22 


34 


54 


Acidovorax avenae (catheter-related infection) 


Rnp-1 like RNA-binding protein 


17 


38 


56 


Delftia acidovorans (catheter-related infection) 


Rnp-1 like RNA-binding protein 


17 


38 


56 



the high degree of similarity between the human and bacterial 
G6PIs shown in Table IV, it can be speculated that a large 
body of human-associated bacteria, including those in the 
normal flora, may share significant homology (greater than 
30%) in their G6PIs with the human protein. K/BxN mice 
spontaneously develop autoantibodies to G6PI and auto- 
immune arthritis (25). The serum autoantibody titer and auto- 
immune arthritis in the K/BxN mice are significantly attenu- 
ated under germ-free conditions, which are reinstated by the 
introduction of a single gut-residing species, asegmented fila- 
mentous bacteria (26). A homology search revealed that the 
G6PI of the segmented filamentous bacteria Candidatus 
Arthromitus sp. SFB-mouse-Japan has 40% homology with a 
mouse G6PI, supporting the role of this bacterium in an- 
ti-G6PI antibody production. 

The ff-enolase is another glycolytic enzyme ubiquitously 
expressed in the cytosol. It is also expressed on the surface 
of stimulated leukocytes, and then serves as a plasmi- 
nogen-binding receptor, which assists in inflammatory cell in- 
vasion (27). The similar usage of enolase and plasminogen 
for the invasion of host tissue has been demonstrated by sev- 
eral pathogens (28). Interestingly, members of the normal 
flora in the oral cavity and gut express the enolase that shares 
a high degree of identity (51 — 55%) and similarity (68~72%) 
with the human ff-enolase (Table V). The enolases of several 
important human pathogens such as Treponema denticola, 
Turicella otitidis, Clostridium botulinum, Bacillus cereus, and 
Neisseria meningitidis also have high degrees of homology 
with the human ff-enolase. 

The hnRNP A2 is an abundant RNA-binding protein that 
is predominantly expressed inside the nucleus and involved 
in pre-mRNA splicing, mRNA transport, and translation (29). 
Although bacteria do not make mRNA, a homology search 
using the hnRNP A2 retrieved 6l bacterial proteins that share 
27-49% identities and 51~73% similarities with the first RNA 
recognition motif (RRM) superfamily domain of hnRNP A2. 
Indeed, all retrieved bacterial proteins were RNA-binding 
proteins. Among them, Variovorax paradoxus is a member 



of the human oral flora, and Acidovorax avenae and Delftia 
acidovorans are involved in catheter-related infection (Table 
VI). 

Collectively, bacterial orthologs exist for almost all auto- 
antigens characterized in RA. It is remarkable that most of 
the pathogenic bacteria listed in Table II ~V have been re- 
ported to develop reactive arthritis, septic arthritis, or rheu- 
matic symptoms (30-38). Furthermore, there is evidence for 
the possible involvement of Clostridium spp. , and Prevotella 
spp. in the pathogenesis of RA (39). Accumulating evidence 
suggests that not only infectious but also indigenous micro- 
organisms may be involved in the initiation and perpetuation 
of RA (39). Increased epithelial permeability, loss of immune 
tolerance, and trafficking of both microbial components and 
activated immune cells to the joints have been suggested as 
underlying mechanisms for the involvement of the indigenous 
bacteria (39). 

CONCLUSION 

We explored seven autoantigens for the presence of evolu- 
tionary conserved counterparts in the bacterial protein 
database. Of the seven autoantigens, PR3, CII, BiP, G6PI, a- 
enolase, and hnRNP A2 have well conserved bacterial 
orthologs. Importantly, those bacterial orthologs are also 
found in human-associated bacteria. Although there are no 
bacterial orthologs for human immunoglobulin gamma- 1 
heavy chain constant region, proteins with an immunoglob- 
ulin superfamily domain are found in bacteria. The wide dis- 
tribution of the highly conserved DnaK, G6PI, or enolase 
among the members of the normal flora and common in- 
fectious microorganisms rather raises the question on how 
cross-reactive autoantibodies are not produced during the im- 
mune response to these bacteria in most healthy people. 
Understanding the mechanisms that deselect auto-reactive B 
cell clones during the germinal center reaction to homologous 
foreign antigens may provide a novel strategy to treat auto- 
immune diseases. 
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